Human Recognition System using Cepstral Information
نویسندگان
چکیده
This paper presents a new method for human recognition using the cepstral information. The proposed method consists in extracting the Linear Frequency Cepstral Coefficients (LFCC) from each heartbeat in the homomorphic domain. Thus, the Hidden Markov Model (HMM) under Hidden Markov Model Toolkit (HTK) is used for electrocardiogram (ECG) classification. To evaluate the performance of the classifier, the number of coefficients and the number of frequency bands are varied. Concerning the HMM topology, the number of Gaussians and states are also varied. The best rate is obtained with 32 coefficients, 24 frequency bands, 1 Gaussian and 5 states. Further, the method is improved by adding dynamic features: the first order delta (∆) and energy (E) to the coefficients. The approach is evaluated on 18 healthy signals of the MIT_BIH database. The obtained results reveal which LFCC with energy that make a 33 dimensional feature vector leads to the best human recognition rate which is 99.33%. Keywords— Electrocardiogram(ECG); Linear Frequency Cepstral Coefficients( LFCC); Hidden Markov Model (HMM).
منابع مشابه
Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملSpeaker recognition using the resynthesized speech via spectrum modeling
Recently, using prosodic information such as pitch and energy for speaker recognition has attracted much attention. However, these kinds of systems yield performance much worse than the traditional cepstral based systems. Limited performance improvement can be achieved when combining the two kinds of systems. In this paper, we present a new approach for speaker recognition, which uses the proso...
متن کاملAn information-theoretic analysis of developmental changes in speech
Developmental changes in the human speech production system signal age-dependent variability in the speech signal properties. In this paper, an informationtheoretic analysis of developmental changes in the speech signal is presented. The effects of age and signal bandwidth on speech signal features are analyzed especially motivated by implications to automatic recognition of children's speech. ...
متن کاملAn improved model of masking effects for robust speech recognition system
Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front-end for Hidden Markov Model (HMM). The proposed algorithm is named LTFC which simulates properti...
متن کامل